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Abstract 

Multiple comparison procedures that control a family-wise error rate or false dis- 
covery rate provide an achieved error rate as the adjusted p- value for each hypothesis 
tested. However, since such p-values are not probabilities that the null hypotheses are 



true, empirical Bayes methods have been devised to estimate such posterior probabil- 
ities, called local false discovery rates (LFDRs) to emphasize the frequency interpre- 
tation of their priors. The main approaches to LFDR estimation, relying either on 
numerical algorithms to maximize likelihood or on the selection of smoothing parame- 
ters for nonparametric density estimation, lack the automatic nature of the methods of 
error rate control. To begin filling the gap, this paper introduces automatic methods 
of LFDR estimation with proven asymptotic conservatism under the independence of 
p- values but without strong parametric assumptions. Simulations indicate that they 
remain conservative even for very small numbers of hypotheses. One of the proposed 
procedures enables interpreting the original FDR control rule in terms of LFDR esti- 
mation, thereby facilitating practical interpretation. The most conservative of the new 
procedures is applied to measured abundance levels of 20 proteins. 

Keywords: Bayesian false discovery rate; confidence distribution; empirical Bayes; local 
false discovery rate; multiple comparison procedure; multiple testing; observed confidence 
level 



1 Introduction 



Since the successful application of the false discovery rate to high-dimensional biological 



data (Efron et al. 2001), methodological research has taken two main directions in addi- 



tion to the hierarchical Bayesian direction in which a joint prior distribution of all unknown 
quantities is given. The purely frequentist line of research has continued to generalize the 



theorem of Benjamini and Hochberg (1995) for strictly controlling the false discovery rate 



and has resulted in methods of similarly controlling related quantities such as the number or 



proportion of false discoveries. (Dudoit and Laan (2008) supply a comprehensive overview 



of multiple testing in that tradition.) The empirical Bayes research stream has developed 
various methods of applying models that have random parameters as well as unknown, fixed 
parameters. The hallmark of the pure frequentist approach to multiple testing, as with fre- 



quentism more generally (Efron, 1986), is the provision of automatic procedures for data 
analysis with guarantees regarding their operating characteristics. In addition, frequentist 
approaches typically apply to small numbers of hypotheses as well as to large numbers. By 
contrast, the main advantage of the empirical Bayes approach is its ability to estimate the 
local counterpart of the false discovery rate, which is a posterior probability that the null 
hypothesis is false without invoking subjective priors. As a posterior probability, the local 
false discovery rate is easily interpretable and leads to asymptotically optimal estimation and 



prediction; see Efron (2010) for examples. However, that advantage comes at the expense 
of guaranteed error rate control and, in the case of nonparametric estimators requiring the 
tuning of smoothing parameters, at the expense of automation and applicability to smaller 



numbers of hypotheses (e.g., Efron, 2004). Fully parametric methods of estimating the 
local false discovery rate tend to require numeric optimization to maximize the likelihood 



function (e.g., Muralidharan 2010, Bickel, 2011). This paper draws from the strengths of 
each research direction by proposing an automatic estimator of the empirical Bayes poste- 
rior probability that may be applied to as few as two hypotheses without making strong 
parametric assumptions. 

Some notation will clarify the concepts. In testing N null hypotheses versus N alternative 
hypotheses, each of which is either true (Ai = 1) or false (Ai = 0), the ith null hypothesis is 
considered rejected if the statistic T, falls within some rejection region T . Every rejection is a 
discovery, a false discovery if the null hypothesis is true (Ai = 0) or a true discovery otherwise 
(Ai = 1). Thus, Nq (T) or N% (T), the number of true or false null hypotheses rejected, is 
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the number of false or true discoveries, respectively. Then iV + (T) = N (T) + N% (T) is the 



total number of discoveries (Efron, 2010). 



With the value of each A, unknown but fixed, Benjamini and Hochberg (1995) defined 
the false discovery rate (FDR) as 



E 



N (T) 
N + (T) V 1 J ' 



where the denominator is the maximum of N + (T) and 1. In other words, the false discovery 
rate is the expectation value of the proportion of discoveries that are false with the convention 
that the proportion of false discoveries is if no discoveries are made. While guaranteeing 
that the FDR does not exceed some critical level needs that seemingly harmless convention 



(Benjamini and Hochberg, 1995), the convention can cause fatal interpretation problems 



unless the probability of making at least one discovery is sufficiently high (Storey, 2002). 

A particularly simple and informative alternative to the FDR is the probability that a 
null hypothesis is true conditional on its rejection: 



$ (T) = Pr (Ai = o\Ti e T) 



E(N (T)) 



E(N + (T)Y 

Due to its association with Bayes's theorem and its modeling each Ai as a random variable, 



$ (T) has been named the "Bayesian false discovery rate" (Efron and Tibshirani, 2002), a 



Whittemore , 


2007 


Morris et al. 


2008) 



and since it suggests the fully Bayesian practice of assigning a prior to every unknown 
quantity. $ (T) will be called the nonlocal false discovery rate (NFDR) to distinguish it 
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from both the FDR and from the local false discovery rate (LFDR), 



( f ) (t i ) = ^({t i }) = Pi(A i = 0\T l = U), 



(1) 



with ti denoting the observed realization of Tj. The LFDR is closer to Bayes-optimal than the 
NFDR in that it is conditional on the observed statistic rather than merely on the event that 
the statistic lies within T. In addition, the LFDR is intuitively appealing as the probability 
that the null hypothesis is true given the reduced data. 

Thus, the primary reason for introducing conservative methods of NFDR estimation for as 
few as a single p- value in Section [2] is to repurpose them for conservative LFDR estimation 
for as few as two p-values in Section [3} Section [4] features an application to testing 20 
hypotheses on the basis of proteomics data. The simulation study of Section [5] quantifies the 
performance of three of the new LFDR estimators for various finite numbers of hypotheses. 
Finally, Section [6] provides a brief discussion, and Appendix A collects proofs omitted from 
previous sections. 



2 Estimation of nonlocal false discovery rates 

Let 7T = Pr (A = 0), II (T) = Pr (T< G T), n (T) = Pr (T, G T\Ai = 0), and IT (T) = 
Pr (Tj G T\Ai = 1). By Bayes's theorem, 



$(T)=Pr (Ai — 0\Ti G T) 



7r n (T) 

n(T) 



(2) 
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which is often estimated by substituting 1 for 7Tq and II (T; N + (T)) = N + (T) /N for II (T): 



$(T;iV + (T)) 



no CO 



TL(T;N + (T)) 



Al, 



(3) 



the minimum of il (T) /II (T; X) and 1. If the test statistics are independent of each other, 
X = N + (T) follows the binomial distribution with parameters N and II (T), and II (T) is 
the maximum-likelihood estimate (MLE) of II (T). Thus, $ (T) is the MLE of II (T) /II (T), 
which is no less than $ (T). 



This estimator also provides a convenient statement of the Benjamini and Hochberg 



(1995) method of controlling the FDR at level q: in terms of upper-tailed testing, 



A; 



l if ti > t (g) 

if ti < t (q) 



(4) 



where t (q) = inf -j ti : i E {1, . . . , N} , $ ([tj, oo)) < q }>, Aj = 1 indicates rejection of the ith. 

Corollary 4.2). The 



null hypothesis, and Ai = indicates its acceptance (Efron 



2010 



practical importance of that relationship is discussed in Section [6j 



The independence model facilitates the derivation of confidence intervals (Efron, 2010). 



For C £ [0, 1] and a realization x of X, let Sc and Sq 1 denote significance and inverse- 
significance functions such that 



S c (n (T) ; x) = Pr {X > x; IT (T)) + C Pr (X = x; II (7*)) ; 



5 c 1 (^(n(T);a;);x)=n(r) 



(5) 
(6) 
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where Pr (•; II (T)) denotes the binomial distribution with parameters N and II (T). Then 



the standard binomial, one-sided (1 — a) 100% confidence intervals for II (T) (Clopper and 



Pearson 



1934) are [0, Sq 1 (1 - a; x)] and [S 1 1 (a; x) , l] . They are valid confidence intervals: 
Pr (S^ (1 - a; X) > U (T)) > 1 - a 



Pr (Sf 1 (a; X) < U (T)) > 1 - a. (7) 

Since A{ rather than the uncertainty of $ (T) is of direct interest, the main value of the 
confidence intervals is in the construction of better point estimates of $ (T) and thus oil — Ai 
for all i satisfying Tj G T. A point estimate $* (T; x) that satisfies Pr ($* (T; x) > $ (T)) > 
1/2 for all 7To, IIo (T) , II (T) G [0,1] is called a median conservative estimator of $(T). 
According to the following proposition, one such estimator is <3> (T; x) = $i (T; x) , the 
C = 1 special case of 

Each $c (7~; x) is called a confidence-posterior median of $ (T) since it is a median of $ (T) 
considered as a function of a random binomial parameter of distribution function S^ 1 (»;x) 



(Bickel, 2010a). $ (T; x) may be considered as a conservative correction to the MLE, as seen 



m 



Fig. i 



Proposition 1. Under the independence of T%, Tjy, the random quantity $(T;X) is a 
median conservative estimator of $ (T) . 

Proof. Independence entails equation ([7]), which implies that 



Since, by formula @, $ (T) < n (T) /II (T) and since n (T) /II (T) < 1, it follows from 
equation ([8]) that $i/ 2 (T; X) is median conservative: 

Pr ($(T;X) > $(T)) > 1/2. (9) 



□ 



That the MLE is not median conservative is evident from the left-hand sides of Figs. [2J[3] 
for the N — 1,2 cases: Pr ^$ (7^; X) > $ (T) j < 1/2 for some combinations of the test-wise 
error rate a and the discovery probability II (T a ), where T a is a level-a critical region such 
that n (T a ) = ct. For contrast with the corrected estimates given by the confidence-posterior 
median $(T;x), the right-hand sides of Figs. [i]j3] illustrate formula (|9]), also for N = 1,2. 
Accordingly, $ (T; X) will be called the corrected estimate of the NFDR. 

The expectation value of a random quantity with respect to Sc (•; x) as the distribution 
function of the random binomial parameter is called a confidence-posterior mean. For ex- 
ample, writing IT' as the dummy variable of integration, the confidence-posterior mean of 
IT (7~) is J WdSc (IT; x). Likewise, according to equation (|2]), the confidence-posterior mean 
of$(T), 

$ c (T; x, 7Tq) = / f ^y^ dS c (W; x) , (10) 



IT 

is a Bayes-confidence-posterior probability that = given Tj G T. As such, it rivals 
the hierarchical Bayes approach to accounting for the uncertainty in $ (T) and is com- 
plete with a decision theory based on minimizing expected loss (Bickel, 2010bfa ), and yet 



without requiring a hyperprior distribution. In practice, 7r will again be set to 1, yielding 
3>c {T; x) = $c (T; x, 1), the confidence-posterior mean of Ilo (T) /II (T), as an upper bound 
of the confidence-posterior mean of $ (T). 



1 of 1 null rejected 



1 of 2 nulls rejected 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 

MLE MLE 



2 of 2 nulls rejected 1 of 3 nulls rejected 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 

MLE MLE 

Figure 1: Nonlocal false discovery rate estimates. The corrected estimate is $ (T; x), the 
confidence posterior median, and the MLE is $ (T; x), the maximum likelihood estimate. 

While median conservatism is a finite- iV property, concepts of asymptotic conservatism 
become prominent in the results of the next section. A random variable 7 (X) is a con- 
servative estimator of some constant 7 if limjv-s>oo P r (7 (X) > 7) = 1. Likewise, J (X) is 
a conservative predictor of some random variable 7 (X) if liniTv^oo Pr (7 (X) > 7 (X)) = 1. 
The estimators of the NFDR considered above are conservative, as will be proven in Ap- 
pendix A: 

Lemma 2. IfT 1: ...,T N are IID, then $ (T;X) ; the members of |$ c (T; X) : C G [0, 1] j , 
and the members of {Ec (T) ; X, 1) : C G [0, 1]} are conservative estimators of $ (T). 
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1.0 




Figure 2: Probability that the MLE $ (T; x) [left] and the corrected estimate $ (T; sc) 
[right] is at least as high as the upper bound of the nonlocal false discovery rate when 
N — 1. Here, "TWER" is the Type I test-wise error rate, and "discovery probability" is the 
probability of rejecting any given null hypothesis. 




Figure 3: Probability that the MLE $ (T; x) [left] and the corrected estimate $ (7~; x) 
[right] is at least as high as the upper bound of the nonlocal false discovery rate when 
N = 2. Here, "TWER" is the Type I test-wise error rate, and "discovery probability" is the 
probability of rejecting any given null hypothesis. 
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3 Estimation of local false discovery rates 



3.1 Additional notation 

Let p designate a one-to-one, monotonic map from each statistic to a p-value such that 
Pi = p (ti) is the p-value that corresponds to the ith null hypothesis, which would be rejected 
ii Pi < a for some Type I test-wise error rate a G [0, 1]. Thus, T a is constrained to satisfy 
T a = {t : p (t) < a}. (The requirement that p be invertible does not rule out two-sided 
tests since they can be equivalently formulated as one-sided tests by transforming the test 
statistic. A two-sided t-test will be used in Section |4j) 

Denote the random p-value of the ith null hypothesis by Pi = p {Ti). The order statistics 
of pi, . . . ,p N and Pi, . . . , P N are p^, . . . ,P(n) an d P(i), • • • , P(n), respectively. In the same 
way, Ti is the rank of pi among the other observed p- values, and Ri is the rank of Pi among the 
other random p-values. The presentation of the methodology is simplified by ensuring that 
ties do not occur in p±, . . . ,Pn, achievable by breaking ties with a pseudorandom-number 
generator, and that they occur with probability in Pi,...,P/v, which follows from the 
stipulations that Tj be a continuous random variable and that the T\, . . . , T/v be IID. Hence, 
p (ri ) = pi and Pr (P(R i ) = Pi) = 1 for alH = 1, . . . , N. 

For economy of notation, $ (a) = $ (T a ) and 

<p(p) = Pi (Aj = 0|p (Ti) = p) 

respectively denote the NFDR and, for any p G [0,1], the LFDR. Since ti = p -1 (pj) 
for any i = 1,...,N, each LFDR agrees with equation ([!]): tp (p,^ = (p(ti). Similarly, 
Nj (a) = Nj (Ta) [j = 0, 1] and iV + (a) = N + (T a ) are the conditional and marginal numbers 
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of discoveries, and IT, (a) = IT,- (T a ) [j = 0, 1] and n + (a) = U + (T a ) are the conditional and 
marginal probabilities that Tj 6 T a . Lastly, $* (a) = $* (a; iV + (a)) = $* (7^; iV + (7^)) will 
represent an estimate of the NFDR, where the function $* may be $c> or $c- 



3.2 Conservative LFDR estimation 

The LFDR (p (jpi) will be estimated by the NFDR estimated with a equal to the p-value 
of twice the rank of p^ if possible or estimated by 1 otherwise. That is, given $* as the 
estimator of the NFDR, <p (p^ is estimated by 



<P (r»; & 



$*( P(2ri) ;iV + (p (2n) )) ifn<f; 



if r, > f . 



For example, the MLE, the corrected estimate, and the confidence-mean estimates of the 
LFDR are 

(p {n) = ip (rf, $) = $ (p (3ri) ) ; (11) 

£ fa) = (r,; = $ (p (3ri) ) ; (12) 

<fc (n) = ip (n; $ c ) = $c {P(2n)) 

for any < N/2 and (r^) = tp (r^) = <£>c (r,) = 1 for any r$ > iV/2. 

The theorem stated below establishes a sense in which an LFDR estimator is conservative 
under general assumptions, including one involving the following conditional version of a 



definition of skewness attributed to Karl Pearson (Abadir, 2005). The Pearson skewness of 
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a random variable Y, conditional on event £ is 

E(Y\£) - median (Y\£) 



skew (Y\S) = 3- 



Let / and F respectively denote the probability density and cumulative distribution functions 
of Pi for each i = 1, . . . , N. 

Theorem 3. Assume that Ti, ...,Tjv are continuous and IID and that ip : [0,1] — > [0,1] is 
monotonically nondecreasing. //$* (a) is a conservative estimator q/$ (a) and skew (tp (Pi) \Pi < 
for any a G (0, 1] ; then ip (Rf, $*) is a conservative predictor of ip (Pi)- 



The proof will appear in Appendix A. Basu and Dasgupta (1997) reviewed various sets of 



sufficient conditions for E (Y) > median (Y) (nonnegative Pearson skewness, skew (Y) > 0). 
This corollary of the theorem follows readily from Lemma [2j 

Corollary 4. Under the conditions of Theorem^ p (Ri), the members of{ip>c {Ri) ■ C E [0, 1]}, 
and the members of {ipc (Ri) : C £ [0,1]} are conservative predictors of ' <p (Pi)- 

Stated less formally, the proposed maximum-likelihood LFDR estimate, corrected LFDR 
estimate, and bound on the confidence-posterior-mean LFDR conservatively estimate the 
LFDR given a sufficiently large number of hypotheses. 

Although ip = p>(») is monotonically increasing, ip (•; $*) in general is not: LFDR esti- 
mates do not necessarily preserve the order of the p- values, which is the order of the actual 
LFDRs. Thus, in the next two sections, the monotonicity of the estimates, 

p(n;$*) <•••<¥> (7*;$*), (13) 



is enforced by this algorithm used with step-down multiple comparison procedures (Westfall 
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and Young, 1993, Dudoit and Laan, 2008): (p (r 2 ; $*) is changed to ip (r x ; $*) if </? (r 2 ; $*) < 
<f (ri; <&*), then </? (r 3 ; $*) is changed to (r 2 ; $*) if y? (r 3 ; $*) < (r 2 ; $*), etc. Since such 
monotonicity enforcement cannot decrease the estimates, conservatism is maintained. 



4 Application to proteomics data 

Levels of 20 proteins were measured in 90 women with breast cancer (55 HER2-positive and 



35 mostly ER/PR-positive) and a group of 64 healthy women. (The data (Li, 2009) are 
from Alex Miron's lab at the Dana-Farber Cancer Institute.) To approximate normality, the 
abundance levels were transformed first by adding the 25th percentile over all the proteins 
and over all the healthy women (yielding positive levels without a hard threshold) and then 
by taking the logarithm. 

For each cancer group (HER2 and ER/PR), there are 20 null hypotheses of no mean 
difference in the transformed level between cancer and healthy groups. Fig. [4] plots £>(rj) 
against pi, the p- value of the two-sample t-test with equal variances for the null hypothesis 
that the iih protein has the same expected abundance level in a cancer group as in the 
healthy group. Each displayed estimate of the LFDR is easily interpretable as a conservative 
estimate of the posterior probability that a given protein has the same average level of 
abundance in a cancer group as it does in the control group. 



5 Simulation study 

Let xl s denote the noncentral \ 2 distribution with 1 degree of freedom and with noncen- 
trality parameter 5. Many test statistics are asymptotically x\si with S = under the null 
hypothesis and 5 > under any alternative hypothesis. Notable tests with such statistics 
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Figure 4: Corrected estimate of the local false discovery rate versus the p-value for the 
application to proteomics data. 

are the likelihood ratio test with a scalar parameter of interest and a local alternative hy- 
pothesis and two-sided tests with asymptotically normal statistics of unit variance. As a 



result, this limit is highly relevant to problems in modern biology (Bickel, 2011), including 
that of Section HI 

Consequently, each simulated data set consisted of N test statistics independently drawn 
from xl o with probability ttq and from x\ 2 with probability I — ttq for each of four values of 7To 
and for each of five values of N. For every (71*0, N) configuration, 100 independent data sets 
were generated. In Figs. [5j [6| and [7| the LFDR estimates are based on NFDR estimates for 
% = 1, . . . , N: the "MLE" tp (r^) estimates the NFDR by equation ( 11 ), the "expectation value" 



(pi/2 (rj) is based on the 7r = 1 upper bound of the confidence-posterior mean of the NFDR 



(10) with C = 1/2, and the "corrected estimate" (pfa) is based on the upper confidence- 
posterior median of the NFDR. (Approximation of each value of <pi/2 (r») was achieved by 
drawing 100 independent Monte Carlo samples from Sy^ (•; x) via S^h (U; x) , U ~ U (0, 1) 



and by averaging according to equation (10). 
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Figure 5: Root mean squared error in local false discovery rate estimation versus N. Each 
of the four panels corresponds to a different value of 7i . 



6 Discussion 

Compared to previous estimators of the LFDR, the main advantages of the proposed methods 
are their proven conservatism (Theorem [3]) and their applicability to very small numbers of 
hypotheses without strong parametric assumptions. The algorithms are simple, requiring 
neither numeric likelihood maximization nor nonparametric smoothing procedures. The 
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Figure 6: Conservatism in local false discovery rate estimation versus N. Conservatism is 
measured by the proportion of estimates that exceed the local false discovery rates they 
estimate. Each of the four panels corresponds to a different value of 7T . 
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Figure 7: Arithmetic bias in local false discovery rate estimation versus N. Each of the four 
panels corresponds to a different value of 7Tq. 
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algorithm for ipfa), the proposed MLE, is particularly simple, being only slightly more 



complicated than the FDR-controlling procedure of Benjamini and Hochberg (1995). 



In fact, (p (ri) can shed light on the practical interpretation of applications of that FDR 
procedure. From equations ^ and (11), it can be seen that the value q at which the 
FDR is controlled for a set of rejected null hypotheses is equal to cp (r^q)) when violations 



of monotonicity (13) are neglected, where i (q) is the index such that is the p- value 
equal to the median of the p-values in the rejection set. Since <p \ Ti( q )) = q is simply a 
conservative estimate of the LFDR corresponding to that median p-value, the lowest half 



of the p-values of the hypotheses rejected by the Benjamini and Hochberg (1995) procedure 



have conservatively estimated posterior probabilities of truth less than or equal to q. 

While Theorem [3] guarantees conservative performance only for sufficiently large numbers 
of hypotheses, examples of finite- iV applications were provided in the proteomics case study 
and in the simulation study. That the proposed methods conservatively estimate the LFDR 
is evident from the proportion of estimates exceeding the true value (Fig. [6]). The slightly 
negative arithmetic bias sometimes seen (Fig. [7]) results from forbidding estimates from 
exceeding 100% rather than from any anti-conservatism. Fig. [5] illustrates how the overall 
performance of the estimators, owing to their conservative nature, perform better for higher 
proportions of true null hypotheses. 
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Appendix A: Additional proofs 



Proof of Lemma 



By the definition of a conservative estimator and by equation (|2]), any random variable of 
the form 

n (T) 



$* (T; X) 



A 1 



IT (T; X) 

is a conservative estimator of $ (T) if the random variable II* (T; X) converges to II (T) 
in probability since n < 1. The estimators $ (T; X) and, for any C G [0,1], (7~; -X") 
are of that form with IT(T;X) = fi(T;X) and II* (T ; X) = 5 C 1 (1/2;X), respectively. 
The convergence of II (T; X) to II (T) is guaranteed by the weak law of large numbers. 
Since S^ 1 (1/2; X) is the median of the random variable that has Sc (•; x) as its distribution 



function and since Sc (•; x) is an asymptotic confidence distribution in the sense of Singh 



et al. (2007), a sufficient condition for its convergence to II (T) is that fixed-level confidence 



intervals formed by Sc (•; x) degenerate to a point as N — > oo (Singh et al. , 2007, Theorem 
3.1). That condition is met since Sc(*',x) is defined by equation (|5]), consistent with the 
confidence intervals of Clopper and Pearson (1934). Thus, the conservatism of $ (T; X) and 



$c (7~; X) are established. 

Similarly, because U (T) /II (T) > $ (T), the conservatism of $<7 (T; X, 1) follows from 
its convergence to IIo (T) /II (T) in probability. Since $c (T; X, 1) as defined in equation 



( 10 ) is a confidence posterior mean of IIo (T) /II (T), its convergence to IIo (T) /II (T) follows 



from the two conditions of Singh et al. (2007, Theorem 3.2) 



1. fixed-level confidence intervals formed by the asymptotic confidence distribution of 
IIo (7~) /II (T) degenerate to a point as iV — > oo; 
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2. the confidence posterior variance 



n (71 

IT 



*c (T;X,1) 



2 



dS c {W;X) 



is bounded in probability. 
The first condition results from the monotonicity between Hq (T) /II' and IT in the integrand 



of equation (10), in which il (T) is fixed, and the fact that, as argued above to establish 
the conservatism of $c (7~; X), the degeneracy condition is met for Sc (•; x), the asymptotic 
confidence distribution of II (T). The second condition follows trivially from the fact that 
the domain of Sc is [0, 1], thereby establishing the conservatism of $c< (T; X, 1). 

Proof of Theorem [3] 

Since $ (a) = E (<p (Pi) |Pj < a), the nonnegative-skewness condition implies 

$ (a) > median (<p (Pi) \P { < a) . 
Thus, defining the variables P[ and PL to be IID with P 4 and P^, respectively, for i = 



1,...,N, 



$ {P m) ) > median (ip (P[) \P[ < P {2Ri) ) 
almost surely. The monotonicity of ip implies that, almost surely, 

median (ip (P[) \P[ < P {2Ri) ) = median (ip (P/) \<p (P[) < ip (P {2Ri) )) ; 



jim Pr (median (ip (P/) |P/ < P (2H() ) = ^ (P {Ri) )) = 1. 
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Because the conservatism of $* (a) means lim^v-^oo Pr ($* (a) > $ («)) = 1, 



= lim Pr ($* (P {2Ri) ) > median ( v (/>') \P[ < P (2Ri) )) 

iV— >oo 

= J™ Pr ( $ * ( p ( 2 ^)) ^ ^ = V? (Pi)) • 



25 



